# Decoupled architecture
Visionreasoner 7B
Apache-2.0
VisionReasoner-7B is an image-text-to-text model that adopts a decoupled architecture and consists of a reasoning model and a segmentation model. It can interpret user intentions and generate pixel-level masks.
Image-to-Text
Transformers English

V
Ricky06662
2,398
1
Seg Zero 7B Best On ReasonSegTest
Other
Seg-Zero-7B is an image segmentation model based on reasoning chain guidance, featuring a decoupled architecture that includes a reasoning model and a segmentation model. It achieves zero-shot generalization capabilities through GRPO reinforcement learning training.
Image Segmentation
Transformers English

S
Ricky06662
724
0
Featured Recommended AI Models